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Abstract 

Multiple default inheritance formalisms for lex- 
icons have attracted much interest in recent 
years. I propose a new efficient method to access 
such lexicons. After showing two basic strate- 
gies for lookup in inheritance lexicons, a com- 
promise is developed which combines to a large 
degree (from a practical point of view) the ad- 
vantages of both strategies and avoids their dis- 
advantages. The method is a kind of (off-line) 
partial evaluation that makes a subset of inher- 
ited information explicit before using the lex- 
icon. 1 identify the parts of a lexicon which 
should be evaluated, and show how partial eval- 
uation works for inheritance lexicons. Finally, 
the theoretical results are confirmed by a com- 
plete implementation. Speedups by a factor of 
10-1 00 are reached. 



1 Introduction 
1.1 Motivation 

In recent years, lexicons based on inheritance for- 
malisms have attracted much interest in compu- 
tational linguistics. These formalisms allow to de- 
velop highly structured lexicons with only small 
redundancy. The property of small redundancy 
leads to positive consequences, e.g., easy con- 
struction, modification, and extension of lexicons 
and support for maintaining consistency and con- 
ciseness. 

Besides the advantages following from nonre- 
dundant lexicons, inheritance lexicons are theo- 
retically attractive as they identify linguistically 
significant classes of lexemes from the vast set of 
all possible classes (cf . ( [Flickinger fc Nerbonnc 92 , 
p. 283)). Finally, descriptive parsimony, which 
can be achieved by inheritance lexicons, is always 
an advantage. 

Despite the increasing popularity of inheritance 
lexicons, lookup for these lexicons is often ineffi- 
cient, or at least not as efficient as possible. There 



* Partial evaluation as described in this paper has been 
successfully applied to the lexicon in the Virtuelle Wis- 
sensfabrik (Virtual Knowledge Factory), a project of the 
German state Nordrhein-Westfalen, which supported this 
research in part. 



are two basic strategies for access to inheritance 
lexicons: 

1. Calculate the extension of a lexicon before ac- 
cess, i. e., the extension of each class describ- 
ing a single lexeme (a lexical class) is com- 
pletely calculated, yielding all word forms of 
this lexeme^, each with complete lexical in- 
formation from the class itself and its direct 
and indirect superclasses. This information, 
standardly represented by some kind of fea- 
ture structure, is stored in a new expanded 
lexicon. The size of the resulting full-form 
lexicon is maximal; lookup time is minimal, 
however, because lookup amounts to index 
search. 

Even if a full-form lexicon can be realized 
for some applications, it is — from a scientific 
point of view — still desirable to achieve sig- 
nificant efficiency improvements if they are 
possible through a reasonable amount of ad- 
ditional work and resources. 

2. Calculate an index (or several indices) be- 
fore access and look up words by using this 
index (cf. dRussell et al. 92 )).p| For language 
analysis, an index from word forms to cor- 
responding lexical classes is needed. Lookup 
of a word in such an indexed inheritance 
lexicon comprises one index search plus ex- 
tending (normally) one class. 

A lookup in an indexed inheritance lexicon is 
slower than a lookup in a full-form lexicon, be- 
cause the latter kind of lookup consists only of an 

1 lt is assumed that the lexicon contains morphological 
information to account for inflection. If the lexicon de- 
scribes only base forms and morphology is treated by a 
separate component, a lexeme is linked to only one fea- 
ture structure, which is defined by multiple inheritance. 
Besides this difference, all arguments about the approach 
presented in this paper remain valid. 

2 An extreme access strategy is to calculate the exten- 
sion of the lexicon for each lookup. An anonymous reviewer 
pointed out to me (among other improvements) that this 
strategy is for most applications inadequate. 



index search. In this paper, partial evaluation of 
inheritance lexicons is explained which speeds up 
lookup significantly (compared to indexed inheri- 
tance lexicons). 

1.2 Inheritance-based lexicon formalism 

In this paper, the lexicon formalism IBL 
(inheritance-based lexicon formalism), 

which (Hartrumpf 96) has developed and im- 
plemented, is used to illustrate the approach 
presented in this paper. This formalism is 
general and powerful and has been successfully 
applied to partially evaluate several lexicons. 
The results of this paper can be transferred to 
other lexicon formalisms with similar inheritance 
concepts (e.g., ELU's formalism, ( [Russell et al 



pj] ; Russell et al. 92|) ) quite easily. IBL's basic 
concepts are summarized in the following as far 
as it is needed to understand partial evaluation 
for inheritance lexicons. 

IBL is heavily inspired by the lexicon formalism 
of the Environnement Linguistique d 'Unification 
(ELU), which uses multiple default inheritance as 
described by ( Russell et al. 91 ; Russell et al. 92|) . 
IBL differs from ELU's formalism in four main 
ways: feature structures may contain complex dis- 
junctions and complex negations (not just atomic 
ones); predicative constraints can use coroutining 
to wait for arguments to become sufficiently in- 
stantiated (for coroutining in logic programming 
cf. ( park fc McCabe 79| ; |SICStus 95|) ); a class can 
decide where in the feature structure to inherit 
information from a superclass (One may call this 
locating inheritance.); IBL is strongly typed, 
i. e., all feature structures must have a type, while 
types are optional in ELU. 

An IBL lexicon consists of type, generator and 
class definitions. Types are defined to type fea- 
ture structures. 

IBL's most important concept are classes. A 
class is denned by its name, a list of direct su- 
perclasses (superclass list), a (possibly empty) 
main feature structure containing definite in- 
formation, a (possibly empty) default feature 
structure containing default information, and a 
set of variant feature structures defining a set 
of variants. Variants are mutually exclusive al- 
ternatives, which can be used to describe differ- 
ent inflectional forms, for instance. The following 
class a, which describes basic properties of Ger- 
man adjectives, exemplifies the use of main, de- 
fault, and variant feature structures. 



class a 
main 

syn" cat = a, 

concat (mor "astern, mor" suffix, form) 
default 

lemma = mor "stem 
variant 

syn "cdegree = pos, 

mor' astern = mor"posstem 
variant 

syn" cdegree = comp, 

mor "astern = mor" comp stem 
variant 

syn "cdegree = sup, 

mor "astern = mor"supstem. 

This class states in its main feature structure that 
all adjectives belong to category a and word forms 
(feature form) of adjectives are always the con- 
catenation (predicate concat) of a specific stem 
mor "astern and the suffix mor "suffix. In the 
next feature structure, it is expressed that the 
lemma is normally equal to the morphological 
stem. Finally, three variants prepare the genera- 
tion of positive, comparative, and superlative ad- 
jective forms, all of which are realized within word 
boundaries. The given class is a nonlexical class 
as it is not a class that provides word forms for a 
specific lexeme (lexical class). This distinction 
is made by ( Russell et al. 9j| ). 

The default information coming from a super- 
class can be overwritten in subclasses. Inher- 
itance conflicts are solved by prioritized inher- 
itance which uses the concept of class prece- 
dence lists (CPL) as described for the Common 
Lisp Object System (CLOS) by QKeene 89|) f| The 
information of a superclass can be inherited by a 
subclass at a certain position (locating inheri- 
tance). The information inherited can not be re- 
stricted or selected (( Hartrumpf 96|) calls this se- 
lective inheritance, which is allowed in DATR 
(|Evans & Gazdar gg ).) b ecause this contradicts 
the principle of data encapsulation and may lead 
to unwieldy and poorly structured lexicons. To 
calculate the extension of a lexical class, one adds 
to it the information from main/variant feature 
structures of its superclasses by unification and 

3 The CPL of a class can be derived by topologically 
sorting its direct and indirect superclasses, so that classes 
that appear together in a superclass list of these super- 
classes keep their order in the CPL and subclasses always 
precede their superclasses. 



the information from default feature structures of 
its superclasses by default unification. 

IBL's third kind of lexical definitions are gen- 
erators. A generator is a rule that can be used to 
generate a new lexical class from another lexical 
class. Such a rule is applied to an element of the 
extension if a special feature gener is set to the 
generator's name. 

In the next section, it is explained how a mul- 
tiple inheritance hierarchy of classes can be par- 
tially evaluated. 

2 Partial evaluation of inheritance 
lexicons 

2.1 Background 

Lexicon lookup in indexed inheritance lexicons 



(as described in section LI) consists of two op- 
erations: first, searching for the relevant class]] 
in the index; second, calculating the extension of 
this class. As the latter step consumes much more 
time than the first, it is the main target for im- 
provements. 

The idea is to precompute the extension of lexi- 
cal classes to reduce lookup time. This precompu- 
tation represents a kind of (off-line) partial eval- 
uation for inheritance lexicons (with respect to 
lexicon lookup). In general, there is a trade-off 
between the size of the partially evaluated lexi- 
con (pe-lexicon) and the work that remains to 
be done at lookup time. This trade-off can be 
characterized by the gap between completely ex- 
tended inheritance lexicons and indexed inheri- 
tance lexicons. 

In this paper, a compromise for this trade-off 
is proposed that yields both: speedup of lexicon 
access by a factor of 10-100 (compared to indexed 
inheritance lexicons) and moderate lexicon size. 

2.2 Partial evaluation without defaults 

For our partial evaluation method, indexed in- 
heritance lexicons were the starting point. For 
indexed inheritance lexicons, the extension of a 
lexical class c is calculated at lookup time accord- 
ing to algorithm 1 in Figure [l]. 

Step 2 will be discussed in section 23. The 
idea of our approach is to precompute some of the 
unification operations in algorithm 1. Two con- 
straints have to be obeyed: first, the number and 



4 To simplify the discussion, it is assumed that only one 
class is relevant for a given query. All arguments remain 
valid if several classes are found in the index. 



The extension ext(c) of a lexical class c with CPL 
cpl(c) = (c\C2 ■ ■ ■ c n ) is calculated in two steps: 
Let Mj, Dj, Vi (1 < i < n) be the main feature 
structure, the default feature structure, and the 
set of variant feature structures, respectively, of 
class Cj. (The big unification operator ( n ) is a 
unification over two sets of feature structures, i. e., 
for two sets A and B of feature structures, AYA B 
is defined as follows: 

AHB := {F A nF B \F A eA,F B eB}\ {±} 
n is a default unification operator.) 

1. ext s {c) : = {Mi}n Vy n 

{M 2 } UV 2 U ... {M n } n v n 

2. ext{c) := (...((ext s (c)f\D 1 )nD 2 ) 

...r\D n ) 

Figure 1: Algorithm 1 



size of pe-results (results of partial evaluation, 
which (together with the lexical classes) form 
the pe-lexicon) must be tolerable; second, there 
should remain only little to do during lookup. 

If one optimizes in favor of the second con- 
straint by doing all unification operations for all 
lexical classes (This is the complete precomputa- 
tion strategy), the number and size of pe-results 
is unacceptable. But if one excludes the lexical 
class itself from partial evaluation (i.e., only the 
unification operations printed in boldface ( HI ) in 
algorithm 1 are performed during partial evalua- 
tion), the number of pe-results shrinks to a toler- 
able magnitude, especially as in a well structured 
inheritance lexicon the following holds (Ci is the 
set of all lexical classes in the lexicon): 

n cp i ■= \{cpl(c) - c\c £ Ci}| < 
\{cpl(c)\c € d}| = |d| =: ni 

For the lexicon further discussed in section n\ 
is 1510, while n cp i is 225. 

Empirical results (see Table [l] below) confirm 
that the number n cp i doesn't increase much when 
the number ni of lexical classes in the lexicon ex- 
ceeds a certain number, or to be more specific, 
n>cpi/ n i approaches zero for growing n/. There- 
fore, the number of parts to be evaluated will be 
tolerable, if one precomputes only the unification 
operations that are in boldface in algorithm 1. 



At lookup time, only the information contained 
in the feature structures of the current lexical 
class in the pe-lexicon has to be added by uni- 
fication to the result of partial evaluation (which 
can be considered to be the only superclass of 
that lexical class in the pe-lexicon), i. e., only the 
unification operations that are not in boldface in 
algorithm 1 have to be calculated. The number 
of unifications which are precomputed for the ex- 
tension of a lexical class is often high (see Table ^ 
below) as variants may lead to exponential growth 
of this number. 

2.3 Partial evaluation with defaults 

The approach presented so far is only correct 
if the operations used in calculating extensions 
are associative and commutative because the nor- 
mal order of operations (bottom-up) is changed 
by partial evaluation. These conditions are met 
by the (nondefault) unifications used in step 1 
of algorithm 1. However, to add defaults (cf. 
step 2 of algorithm 1), normally a default unifica- 
tion operation is used which is neither associative 
nor commutative. This is also true of the pri- 
oritized default unification which is used in IBL 
and which assigns a priority to all atomic feature 
structures of a default feature structure as defined 
by ( Hartrumpf 96| ).p1 

To solve the correctness problem for defaults, 
I use a simple strategy. They are not treated 
during partial evaluation. Instead, one stores the 
atomic feature structures of default feature struc- 
tures in the pe-lexicon and combines them with 
the pe-result (by prioritized default unification) 
at lookup time. 

To sum up partial evaluation for inheritance 
lexicons, algorithm 1 in Figure [l] for calculating 
the extension of a lexical class is separated in a 
partial evaluation part (the algorithm in Figure ^ 
for calculating a pe-lexicon) and a lookup part 
(see next section). 

Partial evaluation of inheritance lexicons is ex- 
emplified by its effects on the morphosyntactic de- 
scription of German adjectives like klein (small) 
in Figure [3|. Classes appear below their di- 
rect superclasses and are connected by directed 
edges. Class a is a graphical representation of the 
class definition in section 11.21 Its direct subclass 



for all c € Ci do: 

1. Let cpl(c) = {c.\c% . . .c n ). Let Mi, D- n Vi (1 < 
i < n) be the main feature structure, the default 
feature structure, and the set of variant feature 
structures, respectively, of class q. 

Calculate and store as a pe-result (if not already 
done for another class whose CPL differs only in 
Pi): 

{M 2 } n V 2 n . . . {M n } n V n , and 
{D 2 ... D n ) 

2. Store for class c a reference to this pe-result, 
and Mi, Vi, D\ (the feature structures of class 
c). 

Figure 2: Algorithm for partial evaluation 



aJorms contains (besides some defaults) variant 
feature structures that describe the possible suf- 
fixes of German adjectives. A lexical class like 
klein inherits from classes aJorms and a indirectly 
through the class a_decl which contains typical 
values for the construction of predicative forms of 
adjectives. The class klein itself contains only the 
idiosyncratic information that the morphological 
stem for this adjective is klein. The right side 
of Figure |3] shows that partial evaluation for in- 
heritance lexicons unfolds the lexical hierarchy so 
that the hierarchy becomes flat. 

2.4 Partial evaluation and lexical rules 

Lexical rules are considered important in many 
lexicon formalisms. Although some lexical rules 
can be replaced by using the variant concept (cf. 



3 There are associative and commutative default unifica- 



section 1.2), there remain cases where lexical rules 
are useful, e. g., for derivational morphology. The 
role of lexical rules is played in IBL by generators 
(cf. section 1.2). A generator consists of two parts: 
a superclass list and a mapping from the current 
feature structure (the input feature structure) to 
a new feature structure (the output feature struc- 
ture). A straightforward treatment of generators 
by partial evaluation is to store all these out- 
put feature structures as new lexical classes (con- 
taining only a main feature structure) in the pe- 
lexicon and to treat the CPL corresponding to 
the superclass list of a generator like a CPL of a 
lexical class .R 



tion oper ations (cf. ( Lascarides & Copestake 95 



Lascaride. 1 



|e£ al. 96 )); however, they might be too inefficient tor ap 



plications that heavily use default unification. 



6 In some systems, it can be useful to delay the applica- 
tion of certain lexical rules. 



Classes 



class a 
main 

default 



FORM COncat(MOR | A.STEM, MOR | SUFFIX) 

SYN | CAT a 
LEMMA 01 
MOR | STEM [H] 

r a_stem m 

MOR r— i 

|_POS_STEM [3 

SYN | CDEGREE pos 

[a_stem f 

MOR r— i 

|_COMP_STEM LiJ 

SYN | CDEGREE COmp 

[a_stem UJ 

MOR i—i 

[SUP.STEM LiJ 

SYN | CDEGREE sup 



class a_forms 

default % d a _f orms omitted 
MOR I SUFFIX "e" 



CASE 


nom 


V 


GEND 


fern 




DECL 


1 




CASE 


nom" 




NUM 


sg 




DECL 


2 




CASE 


acc 




NUM 


sg 




GEND 


-imasc 


DECL 


2 




CASE 


nom 


V 


NUM 


sg 




GEND 


fern 




DECL 


3 





variant 
variant 
variant 
variant 
variant 



USE attr 

% variant for attributive suffix -era omitted 
% variant for attributive suffix -en omitted 
% variant for attributive suffix -er omitted 
% variant for attributive suffix -es omitted 
% variants for predicative suffixes omitted 



1 




class a_decl 

[PRED.O +1 

main MOR 

[ [PRED.E - J 


] 


/ 




lexical class klein (ID 466) 

main [ MOR | STEM "klein"] 



pe-result 71 % corresponds to cpl (klein) — klein 
% — (a_decl a_forms a) 

1: % positive form with suffix -e 
FORM concat(MOR | A.STEM, MOR | SUFFIX) (delayed)' 

A_STEM [3 
POS-STEM 
PRED.O + 
PRED_E 

SUFFIX "e' : 



CASE 


nom 


V 


acc" 


GEND 


fern 




DECL 


1 




CASE 


nom" 




NUM 


sg 




DECL 


2 




CASE 


acc 






NUM 


sg 






GEND 


kinase 




DECL 


2 






CASE 


nom 


V 


acc" 


NUM 


sg 




GEND 


fern 




DECL 


3 





CAT a 

CDEGREE pos 

USE attr 
% positive form with suffix -em omitted 



6: % predicative positive form with suffix -e omitted 
7: % comparative form with suffix -e omitted 



18: % predicative superlative form with suffix -sten omitted 
[lemma U]l\ 

.forms, [ M[)R | STEM [Tjjy 



defaults 



lexical class klein (ID 466) 

main [ MOR | STEM "klein"] 



Index 



klein 


i-> 466 


klein h 


466, {6} 


klcinc 


i '? 466 


klcinc h 


466, {1} 


klcincm 


i ^ 466 


klcincm h 


466, {2} 


klcincn 


i * 466 


klcincn h 


466,{3} 


kleiner 


i ^ 466 


klcincr h 


466,{4,12} 


klcincrc 


i * 466 


klcincre h 


466,{7} 


klcinerem 


i * 466 


klcinerem h 


466,{8} 


klcincrcn 


i-> 466 


klcincren h 


466, {9} 


klcincrcr 


i * 466 


klcincrcr h 


466, {10} 


klcincrcs 


i ^ 466 


klcincrcs h 


466, {11} 


klcincs 


i * 466 


klcincs h 


466, {5} 


klcinstc 


' * 466 


klcinste i- 


466, {13} 


klcinstcm 


i ^ 466 


klcinstem h 


466, {14} 


klcinstcn 


i-> 466 


klcinsten h 


466,{15,18} 


klcinster 


i ^ 466 


klcinster h 


466,{16} 


klcinstcs 


i * 466 


klcinstcs h 


466,{17} 



;ure 3: Effects of partial evaluation for the German adjective klein (left side: original lexicon; ri; 
side: pe-lexicon) 



3 Access to partially evaluated 
lexicons 

A pe-lexicon consists of lexical classes and pe- 
results. Each lexical class refers to one pe-result 
that corresponds to its CPL (without the lexi- 
cal class itself). A pe-result p comprises a list 
Pf of feature structures and a list p,i of default 
atomic feature structures (cf. upper right part of 
Figure ||) . 

To access a pe-lexicon, an index is used which 
is identical to the one used for the original in- 
dexed inheritance lexicon, i.e., a mapping from 
a set of key features to the classes whose exten- 
sions contain relevant information. For language 
analysis, this set of key features usually consists 
only of the (orthographical or phonological) form 
feature; such an index is assumed in the follow- 
ing. After searching for the relevant lexical class 
c in the indexQ, the extension of class c has to be 
determined. First, the pe-result pj for class c is 
retrieved from the pe-lexicon. Second, the main 
and variant feature structures of class c are added 
by unification. Third, the default atomic feature 
structures of class c and the default atomic fea- 
ture structures in p,i are concatenated and added 
by default unification. These steps lead to the 
(complete) extension of class c. 

The approach described is slower than neces- 
sary when pj contains many feature structures 
(as it is often the case for highly inflected lan- 
guages) because the second and third step must 
be done for every member of pf. This situation is 
substantially improved by considering only those 
elements of pf that are relevant for the word form 
it; of a given query. In order to follow this strat- 
egy, the index is extended. A word form index 
has the signature W — ► Ci x 2 N (instead of just 
W — > Ci; W is the set of all word forms, Ci is the 
set of all lexical classes in the lexicon). The addi- 
tional set S of natural numbers in an index entry 
w i ^ (c, S) identifies those elements of pj (of the 
pe-result for cpl(c) — c) that are relevant for the 
given word form w. Figure |3| shows a part of an 
index for an indexed inheritance lexicon and the 
corresponding part of an index for a pe-lexicon, 
and Figure || contains the complete algorithm for 
lookup in pe-lexicons. 



1. w i — > (c, (S) = index entry for word form w 
(Let the lexical class c refer to a pe-result p 
consisting of feature structures pj and atomic 
default feature structures pa = (-D1D2 ■ ■ ■ D n ).) 

2. Calculate for the feature structures PE 
corresponding to feature structures S in pj 
the following (M c , D c , V c are the main feature 
structure, the default feature structure, and the 
variant feature structures for the lexical class c, 
respectively): 

1. ext' s {c) := {M c } \1V C \1PE 

2. ext'(c) := (. . . (({ext' s (c) nD c ) nDi) n 

D 2 )...nD n ) 

Figure 4: Algorithm for lookup in pe-lexicons 



4 Performance of partially evaluated 
lexicons 

4.1 Performance of partial evaluation 

As partial evaluation is done in a separate compi- 
lation step, the size of the resulting pe-lexicon is 
much more important than the runtime (which is 
linear in the number n\ of lexical classes) . A pe- 
lexicon^] contains in addition to the lexical classes 
of the original lexicon (with superclass lists re- 
placed by references to pe-results, cf. section |2.2| ) 
a set of pe-results. If one follows the strategy de- 
scribed in section the size of the pe-lexicon 
grows only linearly to n cp i. 

The number n cp i is bounded by the number 
n\ of lexical classes; in realistic lexicons, n cp i is 
much smaller than rii, as explained in section Ifl. 
Therefore, the size increase for pe-lexicons is tol- 
erable as reflected in Table |]. The lexicons of this 
table were created by adding lexemes in the order 
of a frequency list (see flRosengren 77 , pp. 165- 
219)), which is based on a German newspaper cor- 
pus. 



4.2 Performance of lexicon lookup 

A lookup in a pe-lexicon comprises two steps: an 
index entry is searched and a stored partial exten- 
sion is completed. The first step can be done by 
standard methods from computer science, e.g., 



7 Again, I simplify by describing only cases with one 
relevant class. 



In order not to restrict the generality of the empirical 
results in section ^, the specific treatment of generators (cf . 
section 2.4) is excluded here. 



Lexicon 



pe-Lexicon Size Increase 



m 


n n 




nif s 


te (S) 


t e /ni (s) 


nif s 


ri; f 3 (pe— lexicon) 
n l fa (lexicon) 


336 


96 


107 


724 


24.8 


0.074 


1753 


2.42 


755 


118 


158 


1196 


48.2 


0.064 


3005 


2.51 


1510 


145 


225 


2003 


94.3 


0.062 


5193 


2.59 



Legend 

ni number of lexical classes 
n n number of nonlexical classes 

n cp i number of different CPLs of lexical classes (excluding the lexical class itself) 
nifs number of feature structures in the lexicon 

t e evaluation time for producing the pe-lexicon (SICStus Prolog 3 on a SUN Ultra 1) 
Table 1: Experimental results for partial evaluation 



by using hash tables which allow constant time 
search. 

During the second step, one main feature struc- 
ture and several variant feature structures have 
to be added by unification and the defaults of 
the pe-result and of the lexical class have to be 
added by default unification (as specified in Fig- 
ure In addition, there might be predicative 



co nstraints that had to be delayed during partial 
evaluation. The coroutining mechanism will initi- 
ate their evaluation during this step. This is the 
worst case; a typical lexical class comprises only a 
small main feature structure, if the lexicon is well 
structured |] 

The results in Table [2] show that partial evalu- 
ation achieves significant speedups, especially for 
complex lexical classes. On the average, lookup in 
a pe-lexicon is 60 times faster than in the original 
indexed inheritance lexicon. 

5 Related work 

Partial evaluation is a technique that has been 
applied — in different ways — in several areas: in 
logical, functional, and imperative programming 
(see for instance ( [lones et al. 93[ ) for a thor- 



ough introduction and ( Danvy et al. 96 ) for cur- 
rent research); for efficient access to methods in 



9 If the feature structure that is to be added by uni- 
fication contains only few atomic feature structures, the 
operation is sped up, as the operation used need not be 
(and is not) a full unification: the atomic feature struc- 
tures only have to constrain the current intermediate re- 
sult of calculating the extension of a class; the other way 
of constraining — as it is required by full unification — is not 
necessary. 



programming languages with single inheritance 
( flKhoo &; Sundaresh 91] )); etc. The problem of 
partial evaluation for inheritance lexicons differs 
in two ways: while behavior is limited to lookup 
functions, the data is complex due to its large 
size and high expressivity (multiple inheritance, 
defaults, disjunctions, negations). 



In the context of inheritance lexicons, flCopes- 



take 93b, p. 240) mentions for the LKB (lexical 



knowledge base, ( |Copestake 93a|) ) that expanded 
classes (expanded psorts in LKB) can be cached, 
but does not provide any details. If inheritance 
proceeds top-down, as in the LKB, partial evalua- 
tion will be somewhat easier because the informa- 
tion that is added from the lexical class has always 
lower priority than the information of its super- 
classes and thus defaults don't have to be treated 
specially by partial evaluation. However, this 
practice is for most applications counter-intuitive 
(although it is justified for the original domain 
of LKB, semiautomatic extraction of lexical in- 
formation from machine-readable dictionaries, as 
(|Copestake 93b , p. 239) shows) because normally 
a class should have precedence over its super- 
classes. 

6 Perspectives 

The results for partial evaluation of inheritance 
lexicons presented in this paper improve the prac- 
tical use of such lexicons significantly. Interesting 
questions for further research remain, however. 

The number of pe-results might be too high 
for some combinations of lexicons and applica- 
tions. To reduce this number, the set of classes 
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Lexicon 



pe-Lexicon 



Speedup 
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nfs 
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ti (ms) 
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n a f s 




ti (ms) 


t[ (lexicon) 
t[ (pe — lexicon) 


wegen (preposition) 
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8.0 
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3 





0.7 


11.4 


Kreises (noun) 
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18 


52 


31.6 
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8 





1.2 


26.3 


schonste (adjective) 


4 


72 


332 


168.8 
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9 


1 


4.9 


34.4 


laufe (verb) 


7 


538 


2204 


385.1 


2 


24 


2 


5.3 


72.7 


average (1510) 


3.63 


153 


631 


133.2 


2.79 


10.06 


0.44 


2.2 


60.5 



Legend 

n s number of superclasses 

rif s number of feature structures in superclasses (added during extending) 
n a f s number of atomic feature structures in superclasses (added during extending) 
ti lookup time (SICStus Prolog 3 on a SUN Ultra 1) 
ridp number of delayed predicates 

Table 2: Experimental results for lexicon lookup 



can be partitioned. According to these partitions, 
the CPL of a class is split up in parts and only 
those partial CPLs are evaluated. During lexi- 
con lookup, it will be necessary to combine re- 
sults from different partial CPLs. This combi- 
nation will be easier if the class partitions are 
orthogonal. For example, if, in the lexicon at 
hand, morphosyntactic and semantic information 
don't interact much, one may use a class partition 
morsyn and a class partition sem. 

Two more questions arise from the first: How 
can useful partitions of classes be linguistically 
defined or automatically determined? And: How 
do these partitions influence lookup time and size 
of pe-lexicons? 

Another perspective is the use of results 
from partial evaluation to infer new linguisti- 
cally relevant classes in inheritance lexicons semi- 
automatically. 
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